CDS

Accession Number TCMCG061C12336
gbkey CDS
Protein Id XP_042050203.1
Location join(32427083..32427175,32427241..32427276,32427358..32427470,32428085..32428130,32429130..32429176,32429411..32429518,32430127..32430202,32430738..32430810,32430910..32431055,32431127..32431212,32431550..32431689,32431762..32431901,32431986..32432036,32432695..32432758,32434467..32434610,32434681..32434748,32434857..32434915,32435587..32435743,32436062..32436114,32436195..32436293,32437366..32437480,32437553..32437675,32438195..32438359)
Gene LOC121795694
GeneID 121795694
Organism Salvia splendens

Protein

Length 733aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA737421
db_source XM_042194269.1
Definition DNA mismatch repair protein MSH4 isoform X3 [Salvia splendens]

EGGNOG-MAPPER Annotation

COG_category L
Description MutS family domain IV
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K08740        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGTAACTATTGTTACACCTACAAAGCTAGCACCAGACGGCATGGTTGGAGTCTCAGAGCTGGTCGATAAATTTTGCTCTTCATCCACAAAGGTTATAATGGCCCGTGGTTGCTTTGATGACACAAGGGGGGCTGCGCTGGTGAAAAACTTGGCAGCTAATGAACCATCTGCTCTTGGTCTGGAGACTTATTCCAAACAATATTATCTTTGCTTGGCTGCAGCTGCTGCAACTATCAAGTGGATAGAAGCTGAGAAAGGGTTGATCATCACAAATCACTCACTGTCAGTCACATTTAATGGATCATTTGACCACATGAATATAGACGCTACTAGTGTCCAAAACTTGGAAATTATAGAGCCGATGCACTCTTGTCTTTGGGGCTCTAACAACAAGAAGAGAAGTTTATTTCACATTCTCAAAACAACAAGGACAGTGGGAGGCACCAGACTTTTGCGCGCCAATCTTTTGCAGCCCCTAAAAAACATTGAGACTATCAATGCCCGTCTAGATTGTCTTGATGAGCTAATGAGCAATGAGCAACTATTTTTTGGCCTCTCTCAGGCTCTCCGTAAGTTTCCAAAAGAAACTGATAAGGTCCTCTGTCATTTCTGCTTCAAGCAAAAGAAAGTTACCAATGAAGTCTTGACCAGTGACACTTCCAGAAAGAGCCAAATCTTGATATCAAGCATTATTCTTCTTAAAACAGCTCTTGATTCCTTGCCACTACTCTCCAAGGTGCTTAAGGATGCAAATTCTTACCTATTCAAAAATATATACAAGTCCATATGCGATAATGAAAAGTTTACTACTATGCGAACAAGGATAGGAGAGGTCATTGATGACGATGTTCTTCATGCGCGTGTTCCTTTTGTTGCTCGAACTCAACAGTGTTTTGCTGTCAAGGCAGGAATTGATGGACTTCTAGATATTGCTCGGAGATCATTCTGTGATACTAGTGAAGCAATATACAACTTGGCAAACAAATACAGGGAGGACTTTAAGCTGCCAAATTTGAAAATCCCATATAATAGCAGACAAGGTTTTTACTTTAACATACCTCAGAAGGAAATACAAGGAAAACTTCCCAACAAGTTTATCCAGGTCAATAGACATGGAAACAACATACATTGTTCTTCTTTGGAACTGGCCTCTTTGAATGTAAGAAACAAGTCTGCAGCTAAAGAATGCTACATCCGGACAGAGTGTTGCCTGGAGGAACTGATGGAAGACATACGGAAGGATGTCTCTCAGCTCACTTTTCTTGCTGAGGTTTTATGCCTTCTCGATATGATAGTCAATTCATTTGCTCATACAATATCAACGAAGCCAGATGAAAAATACACCAGACCAAGATTTACATATGATGGACCATTGGCAATTGATTCAGGAAGACATCCCATTCTTGAAAACGTACACAATGAGTTTGTGGCCAACAACATTTTTCTTTCTGAAGCATCAAATATGGTAGTTGTAACGGGCCCAAACATGAGTGGGAAGAGTACTTATCTTCAGCAAGTTTGCCTGGTAGTCATCCTCGCTCAAATTGGGTGTTACGTTCCTGCTCAGTTTGCAACTTTGAGAGTAGTTGATCGCATATTTACTAGGATGGGAACTATGGACAGTGTTGAATCAAATTCTAGCTCGTTTATGACCGAGATGAAAGAGACTGCTTTTATCCTGCAAAATGCTTCTCATAGAAGTCTGATTGTTGTAGATGAATTAGGGAGAGCAACATCTTCCTCTGATGGGTTTGCAATTGCTTGGAGCTGTTGTGAACATCTGCTGGCTTTAAAAGCGTACACCATATTTGCTACTCATATGGAGAACTTATCTGAGCTGGCCACAACATATCCAAATGTGAAAATTGTACACTTTGATGTCGAGATCAAGAATAAGCACATGGATTTCAAGTTTCAACTGAAAGATGGGCCCCGAACTGTAGCGCACTATGGCCTTATGCTAGCAAGCGTAGCTGGACTACCGATTCCAGTGATAGAGTTGGCCAAAAGTATCACATCAAAGATTACACAGAAGGAAGCAGAGAGAATACAGATCAGCTTTTGTAAGCATCATGATCTTCAAATGGCATATCGTGTTGCTCAACGACTTATATGTCTGAAATTCTCTAACCAAGACGAAGACTCTATCCGAGCAGCACTGCAGAATCTCAAGGAATGCTGTATTCAGGGAGGCCTTTGA
Protein:  
MVTIVTPTKLAPDGMVGVSELVDKFCSSSTKVIMARGCFDDTRGAALVKNLAANEPSALGLETYSKQYYLCLAAAAATIKWIEAEKGLIITNHSLSVTFNGSFDHMNIDATSVQNLEIIEPMHSCLWGSNNKKRSLFHILKTTRTVGGTRLLRANLLQPLKNIETINARLDCLDELMSNEQLFFGLSQALRKFPKETDKVLCHFCFKQKKVTNEVLTSDTSRKSQILISSIILLKTALDSLPLLSKVLKDANSYLFKNIYKSICDNEKFTTMRTRIGEVIDDDVLHARVPFVARTQQCFAVKAGIDGLLDIARRSFCDTSEAIYNLANKYREDFKLPNLKIPYNSRQGFYFNIPQKEIQGKLPNKFIQVNRHGNNIHCSSLELASLNVRNKSAAKECYIRTECCLEELMEDIRKDVSQLTFLAEVLCLLDMIVNSFAHTISTKPDEKYTRPRFTYDGPLAIDSGRHPILENVHNEFVANNIFLSEASNMVVVTGPNMSGKSTYLQQVCLVVILAQIGCYVPAQFATLRVVDRIFTRMGTMDSVESNSSSFMTEMKETAFILQNASHRSLIVVDELGRATSSSDGFAIAWSCCEHLLALKAYTIFATHMENLSELATTYPNVKIVHFDVEIKNKHMDFKFQLKDGPRTVAHYGLMLASVAGLPIPVIELAKSITSKITQKEAERIQISFCKHHDLQMAYRVAQRLICLKFSNQDEDSIRAALQNLKECCIQGGL